Clinical Practice and Epidemiology in Mental Health
نویسنده
چکیده
Background: The 12-item General Health Questionnaire (GHQ-12) is used routinely as a unidimensional measure of psychological morbidity. Many factor-analytic studies have reported that the GHQ-12 has two or three dimensions, threatening its validity. It is possible that these 'dimensions' are the result of the wording of the GHQ-12, namely its division into positively phrased (PP) and negatively phrased (NP) statements about mood states. Such 'method effects' introduce response bias which should be taken into account when deriving and interpreting factors. Methods: GHQ-12 data were obtained from the 2004 cohort of the Health Survey for England (N = 3705). Following exploratory factor analysis (EFA), the goodness of fit indices of one, two and three factor models were compared with those of a unidimensional model specifying response bias on the NP items, using structural equation modelling (SEM). The hypotheses were (1) the variance of the responses would be significantly higher for NP items than for PP items because of response bias, and (2) that the modelling of response bias would provide the best fit for the data. Results: Consistent with previous reports, EFA suggested a two-factor solution dividing the items into NP and PP items. The variance of responses to the NP items was substantially and significantly higher than for the PP items. The model incorporating response bias was the best fit for the data on all indices (RMSEA = 0.068, 90%CL = 0.064, 0.073). Analysis of the frequency of responses suggests that the response bias derives from the ambiguity of the response options for the absence of negative mood states. Conclusion: The data are consistent with the GHQ-12 being a unidimensional scale with a substantial degree of response bias for the negatively phrased items. Studies that report the GHQ12 as multidimensional without taking this response bias into account risk interpreting the artefactual factor structure as denoting 'real' constructs, committing the methodological error of reification. Although the GHQ-12 seems unidimensional as intended, the presence of such a large response bias should be taken into account in the analysis of GHQ-12 data. Published: 24 April 2008 Clinical Practice and Epidemiology in Mental Health 2008, 4:10 doi:10.1186/1745-0179-410 Received: 4 February 2008 Accepted: 24 April 2008 This article is available from: http://www.cpementalhealth.com/content/4/1/10 © 2008 Hankins; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Clinical Practice and Epidemiology in Mental Health 2008, 4:10 http://www.cpementalhealth.com/content/4/1/10 Page 2 of 8 (page number not for citation purposes) Background The 12-item General Health Questionnaire (GHQ-12) is a self-report measure of psychological morbidity, intended to detect "psychiatric disorders...in community settings and non-psychiatric settings" [1]. It is widely used in both clinical practice [2], epidemiological research [3] and psychological research [4]. The GHQ-12 has been extensively evaluated in terms of its validity and reliability as a unidimensional index of severity of psychological morbidity [5-9]. Many studies, however, have reported that the GHQ-12 is not unidimensional, but instead assesses psychological morbidity in two or three dimensions [10-19]. Several twoand three-dimensional models have been proposed (see Martin & Newell 2005 for review [20]), and to date no study examining the factor structure of the GHQ-12 has found it to be unidimensional. These various factors have been interpreted as substantive psychological constructs such as 'Anxiety', 'Psychological distress', 'Social Dysfunction', 'Positive Health', for example. When these competing models have been compared [10,20], confirmatory factor analysis suggests that the best fitting model is the three-dimensional model of Graetz [21], which proposes that the GHQ-12 measures three distinct constructs of 'Anxiety', 'Social dysfunction' and 'Loss of confidence'. The GHQ-12 was validated on the assumption that it was a unidimensional and generic measure of psychiatric morbidity: if it truly is multidimensional, measuring psychological functioning in three specific domains, then the validation data may be questioned, and hence the use of the GHQ-12 in clinical practice and research contexts. Despite the apparently consistent evidence that the GHQ12 is multifactorial, the findings of these studies could be interpreted as demonstrations of a phenomenon longknown in the psychometric literature: that measure comprising a mixture of positive and negative statements can produce an entirely artefactual division into factors [2227]. Such artefacts, introduced as they are by the method of measurement, are known as 'method effects'. The GHQ-12 itself comprises six items that are positive descriptions of mood states (e.g "felt able to overcome difficulties") and six that are negative descriptions of mood states (e.g. "felt like a worthless person"). For brevity these will be referred to as 'negatively phrased items' (NP items) and 'positively phrased items' (PP items) respectively. Consistent with the hypothesis that these factors derive from method effects, a casual inspection of the two and three factor structures previously reported reveals that they comprise most (or all) of the NP items and most (or all) of the PP items. For example, of the four two factor solutions included in the review by Newell & Martin, two comprise all of the NP items vs. all of the PP items, and two comprise all of the NP items plus one PP item vs. the remaining PP items. Of the three three factor solutions reviewed, the additional factor comprises either two NP items or two PP items, i.e. the division between NP and PP items is maintained. In addition, when reported, the correlations between dimensions are very high; this is again consistent with the hypothesis that the GHQ-12 is unidimensional, and that the apparent twoor three-dimensional structure is artefactual. Hence the question arises of whether the factors identified in these studies truly reflect clinically distinct dimensions of psychological morbidity. If the dimensions identified by these studies are simply due to the measurement bias introduced by method effects, then the constructs identified do not exist (a methodological error known as 'reification' [28]). The analyses so far reported employed either exploratory factor analysis (EFA) to derive the factor structure or confirmatory factor analysis (CFA) to confirm the factor structure of the GHQ-12. EFA cannot in principle distinguish between genuine multidimensional structure and spurious factors generated by method effects. CFA, used appropriately, can model such method effects and so test the hypothesis that the apparent factor structure is artefactual. Unfortunately, studies employing CFA have only tested the models originally generated by EFA; used in this way, CFA is also in principle incapable of distinguishing between a genuine factor structure and an artefactual one. The modelling of the proposed method effect is, however, relatively straightforward. Various mechanisms have been proposed for the general phenomenon of NP and PP items separating into factors, focusing on respondents' difficulties in processing negatively-phrased items. It has been suggested that negatively phrased items are more difficult to process because of inattention, variation in education, or an aversion to negative emotional content [22,24,25]. Whatever the cause, since response bias introduces variation in responses not due to variation in the measured construct, nor to random measurement error (both of which should apply equally to all items), then it may be hypothesised that response bias creates additional variation in negatively-phrased item scores, and that this additional variance is common to the negatively phrased items but not the positively phrased items. Two hypotheses follow from this: (1) NP items should have significantly higher variance than PP items and (2) a unidimensional model incorporating response bias on the NP items should fit the data better than the twoor three-factor models proposed. The aim of this study was therefore to explore the possibility that the previously-reported factor structures of the Clinical Practice and Epidemiology in Mental Health 2008, 4:10 http://www.cpementalhealth.com/content/4/1/10 Page 3 of 8 (page number not for citation purposes) GHQ-12 were due to method effects, namely a response bias on the negatively phrased items. Methods GHQ-12 data were obtained from the 2004 cohort of the Health Survey for England, a longitudinal general population conducted in the UK. Sampling and methodological details are in the public domain [29]. For the purposes of this study, a single adult was selected from each household to maintain independence of data. Of the 4000 such adults, 3705 provided complete data for the GHQ-12. The Likert scoring method was used, with each of the twelve items scored in the range 1 to 4, 4 indicating the most negative mood state. For clarity, the six NP items were labelled n1 to n6 and the six PP items p1 to p6. The variances of all items were computed with 95% confidence limits, the hypothesis being that the variances on the negative questions would be uniformly greater than those on the positive questions. In addition, Pearson correlation coefficients were calculated between all items (producing a correlation matrix). Exploratory factor analysis was first conducted to explore whether the data would replicate either the twoor three-factor solutions previously reported; in the context of this study, this was a necessary step, since in the absence of a multifactorial solution there would be no method effect to explain. Consistent with most of the previous EFA analyses, the principal components method was used, with orthogonal (Varimax) rotation. Following EFA, four models were tested for goodness of fit (CFA) using the structural equation modelling package AMOS 6.0 (maximum likelihood method): 1. Unidimensional: one factor, i.e. the intended GHQ-12 measurement model 2. Confirmatory analysis of the EFA solution 3. Three dimensional: three correlated factors, derived from the Graetz [21] model of three factors measured by 6 items ('anxiety', p1 to p6), 4 items ('social dysfunction', n1 to n4) and two items ('loss of confidence', n5 and n6) 4. Unidimensional with method effects: one factor (essentially Model 1) with correlated error terms on the NP items Models 1 to 3 are typical of the EFA/CFA approach taken to date. Model 4 is the model hypothesised to have the best fit for the data. Path diagrams for all models are presented in Figure 1. As recommended by Byrne [30] a range of goodness of fit indices were computed for model comparison. These indices differ primarily in their treatment of sample size, model parsimony and the range of values considered to indicate a well-fitting model. The goodnessof-fit index (GFI), adjusted goodness-of-fit index (AGFI), normed fit index (NFI) and comparative fit index (CFI) all increase towards a maximum value of 1.00 for a perfect fit, with values around 0.950 indicating a good fit for the data. In contrast, the values of the root mean square error of approximation (RMSEA) and the expected cross-validation index (ECVI) decrease with increasingly good fit, and are not limited to the range zero to one. The ECVI indicates how well the model will cross-validate to another sample, while the RMSEA provides a 'rule of thumb' cutoff for model adequacy of less than 0.08. Results Exploratory factor analysis (EFA) Table 1 shows the Pearson correlation coefficients for all items, with correlations between similarly-phrased items highlighted in bold. Exploratory factor analysis (principal components analysis with Varimax rotation) suggested a two factor solution explaining 59.0% of the total variance in items (factor 1 eigenvalue = 3.9; factor 2 eigenvalue = 3.1). Consistent with the hypotheses of this study, the first factor comprised all of the NP items and the second factor all of the PP items. Table 2 shows the rotated component matrix for all items, illustrating the division between NP and PP items. This two-factor model was further examined by CFA, below (hypothesis 2, model 2). Hypothesis 1: Comparison of the variances of NP and PP items Table 3 shows the variances of all items with 95% confidence limits (see also Figure 2 for a graphical representation of the variances of the items). It can be readily seen that the variances of the NP items (median 0.515) were uniformly higher than those of the PP items (median 0.215), with no overlap in the 95% confidence limits. This supports the first hypothesis that the NP items would have increased variances due to additional error variance caused by response bias. Hypothesis 2: Comparison of models Table 4 shows the goodness-of-fit measures for all models. Of the previously reported models the best fitting was the Graetz three dimensional model; indeed, of models 1 to 3 this was the only model with an acceptable fit (RMSEA = 0.073, 90%CL (0.069, 0.077); ECVI = 0.301, 90%CL (0.274, 0.331). Model 4 was, however, superior in all fit measures to model 3 (RMSEA = 0.068, 90%CL (0.064, 0.073; ECVI = 0.214, 90%CL (0.191, 0.238)), thus confirming the second hypothesis. Exploration of NP response bias Figure 3 shows frequency histograms for the six PP items. It can be seen that most of the respondents indicated the Clinical Practice and Epidemiology in Mental Health 2008, 4:10 http://www.cpementalhealth.com/content/4/1/10 Page 4 of 8 (page number not for citation purposes) presence of positive mood states by responding on point 2 of the four point Likert type scale, indicating that these positive mood states were experienced with the 'Same' frequency as usual (i.e. a lack of pathology was indicated by a PP item score of 2). In contrast, Figure 4 shows the frequency histograms for the six NP items. It can be seen that the same respondents indicated an absence of negative mood states for items n1 to n4 with a response of either 1 Model specification (models 1 to 4) Figure 1 Model specification (models 1 to 4). Model 1: Unidimensional Model 2: Two factor EFA solution Psychological Distress p1 ep1
منابع مشابه
The Devil Is in the Details! On Regulating Cannabis Use in Canada Based on Public Health Criteria; Comment on “Legalizing and Regulating Marijuana in Canada: Review of Potential Economic, Social, and Health Impacts”
This commentary to the editorial of Hajizadeh argues that the economic, social and health consequences of legalizing cannabis in Canada will depend in large part on the exact stipulations (mainly from the federal government) and on the implementation, regulation and practice of the legalization act (on provincial and municipal levels). A strict regulatory framework is necessary to minimize the ...
متن کاملHealthcare Providers’ Awareness and Practice on Electronic Mental Health Interventions for Depression and its’ Utilization Barriers
Introduction: E-mental health interventions can be effective in the early diagnosis and treatment of mental problems such as depression. Therefore, the present study aimed to evaluate the awareness and practice of health care providers on E-mental health interventions and its’ utilization barriers. Method: This descriptive study was conducted in Kerman in 2018. The research population consisted...
متن کاملHealthcare Providers’ Awareness and Practice on Electronic Mental Health Interventions for Depression and its’ Utilization Barriers
Introduction: E-mental health interventions can be effective in the early diagnosis and treatment of mental problems such as depression. Therefore, the present study aimed to evaluate the awareness and practice of health care providers on E-mental health interventions and its’ utilization barriers. Method: This descriptive study was conducted in Kerman in 2018. The research population consisted...
متن کاملClinical Practice and Epidemiology in Mental Health
Epidemilogical and clincial use of GMHAT-PC ( Global Mental Health assessment toolprimary care )in cardiac patients Clinical Practice and Epidemiology in Mental Health 2009, 5:7 doi:10.1186/1745-0179-5-7 Murali Krishna ([email protected]) Peter Lepping ([email protected]) Vimal K Sharma ([email protected]) John R M Copeland ([email protected]) Lorraine Lockwood (lorraine.locwo...
متن کاملEpidemiology of Alcohol Abuse in Iran
Background: World Health Organization report in 2014 indicates a considerable range of alcohol consumption across the world which had a growing trend during last 5 years. A study on 5,231 students in 57 universities has shown that alcohol consumption became more common than other drugs, and alcohol is in the first rank for repeated usage. In another study in 2010, the prevalence of alcohol use ...
متن کامل